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Respiratory syncytial virus BA genotype has reportedly replaced 
other group B genotypes worldwide. We report the observation of 
three group B viruses, all identical in G sequence but lacking the BA 
duplication, at a coastal district hospital in Kenya in early 2012. This 
follows a period of six consecutive respiratory syncytial virus (RSV) 
epidemics with 100% BA dominance among group B isolates. The 
new strains appear only distantly related to BA variants and to 



previously circulating SAB I viruses last seen in the district in 2005, 
suggesting that they were circulating elsewhere undetected. These 
results are of relevance to an understanding of RSV persistence. 

Keywords Attachment (G gene), BA genotype, genetic diversity, 
respiratory syncytial virus. 



Please cite this paper as: Agoti et al. (2013} Identification of group B RSVs that lack the 60-nucleotide duplication after 6 consecutive epidemics of total BA 
dominance at coastal Kenya. Influenza and Other Respiratory Viruses 7(6), 1008-1012. 



Background 

Human respiratory syncytial virus (RSV) is the leading viral 
cause of acute lower respiratory tract illnesses in infants and 
young children worldwide' and is associated with annual or 
biannual epidemics.^ RSV isolates can be classified into two 
genetically and antigenically distinct groups, A and B, within 
which further diversity is reported.^^ RSV repeatedly re-infects 
individuals throughout life, and the strains involved are often 
genetically distinct."' The differences occurring both between 
and within the groups are most pronounced at the attach- 
ment (G) protein gene, which encodes a surface-expressed 
protein of the virus known to be targeted by the host 
neutralizing antibody response.' 

In the last two decades, sequencing and phylogenetic analysis 
of the G gene have allowed identification of several genotypes 
within the two groups.^ It has also been noted that the 
circulating diversity within the groups is dynamic, and new 
genotypes periodically emerge, but also some previously 
circulating genotypes appear to have become extinct.^ Locally, 
over successive epidemics, predominant genetic variants within 
the groups become replaced.' Continued observation of the 
dynamics of circulating RSV genetic and antigenic diversity is 
essential for the design of better RSV control approaches. 



A new group B strain with a 60-nucleotide duplication 
within the G gene was first observed in Buenos Aires, 
Argentina, in 1999 (named BA genotype) and spread around 
the world in 3-4 years.' Interestingly, the BA genotype 
appeared to replace all the other group B genotypes in RSV 
epidemics that occurred in the second half of the last 
decade.^ The cause of the remarkable BA genotype epidemi- 
ological success remains unknown.^ Few non-BA cases were 
observed after lOOS,**"'" these occurring in Cambodia, Brazil 
and China between 2007 and 2009. The epidemiological 
dynamics of the BA genotype have indicated that (i) new 
RSV strains can transmit to all parts of the world in a 
relatively short time period, and (ii) there is ongoing within- 
group competition between RSV genotypes. Monitoring the 
transmission and evolutionary dynamics of group B RSV 
strains after the BA genotype emergence allow the study of an 
RSV genotype from emergence to, potentially, its extinction. 

Hospital-based RSV surveillance at coastal Kenya, 2002-to 
date, was established with the objectives of documenting RSV 
disease burden and RSV molecular and immuno-epidemiol- 
ogy" within the region. The surveillance period overlaps 
with the period when changes have been observed in 
circulating RSV B genotypes around the world. Here, we 
report the results from sequencing and genotyping of all 
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group B RSVs that we identified over the 11 years of the 
surveillance, and the results of analysis of novel non-BA 
strains we observed in three patients infected in early 2012. 

Methods 

The samples analyzed were obtained from under five-year- 
olds admitted with pneumonia to the Kilifi District Hospi- 
tal'^ and span the period between January 2002 and mid- 
2012 (the latter marking the end of the 2011-2012 epidemic). 
Either a parent or a guardian provided informed consent on 
behalf of each child. The specimens (nasal washings, 
aspirates, or swabs) were screened for RSV using an 
immunofluorescence assay (IFAT), and samples positive for 
RSV were subtyped into groups A and B using a conventional 
multiplex RT-PCR or a multiplex real-time PGR assay. 
Group B RSV-positive samples were sequenced in the 
ectodomain region of the G gene as previously described.* 

Sequences were aligned using bioedit software (http:// 
www.mbio.ncsu.edu/bioedit/bioedit.html). Phylogenetic anal- 
yses were performed in mega 5.1 program (http://www. 
megasoftware.net/). The dates to the most recent common 
ancestor (MRGA) between sequence clusters were estimated 
using Bayesian methods within the beast software version 
1.74,'^ and convergence was confirmed in tracer program 
version 1.5. Representative strains of previously identified 
group B genotypes were included in the analysis to help 
classify the Kilifi strains. 

Results and discussion 

Over the period January 1, 2002, to June 31, 2012, 574 group 
B RSVs were identified among the RSV IFAT-positive 
samples, and 488 (85-0%) of these were successfully 
sequenced in the G ectodomain region. The genotyping 
results over the 1 1 consecutive epidemics are summarized in 
Table 1. Early in the surveillance, BA strains were detected as 
a rare genotype, making only 7-3% (3 of 41) of the sequenced 
group B genotypes between January 2002 and June 2004. The 
prevalence of BA strains dramatically increased to 97-3% 
(108 of 111) in the 2004-2005 epidemic, and thereafter, all 
the group B samples identified and sequenced between June 
2005 and December 2011 possessed the BA duplication. 
However, in sequencing B specimens from the first half of 
2012, three lacked the BA duplication. This was an 
unexpected finding occurring after multiple epidemics of 
non-BA genotypes absence; thus, we undertook further 
analysis of these three sequences to ascertain their possible 
origin and phylogenetic relationship with other RSV 
sequences deposited in public databases. 

The three patients were admitted on March 1, on March 
23, and on May 16, 2012, and were aged 14-5, 1-7, and 5-5 
months, respectively, at the time of admission. Two of the 



Table 1. Tine distribution of the different group B genotypes at Kilifi 
during tlie 1 1 consecutive epidemics of tlie surveillance 



Total 

Peak group B 

epidemic strains 
month identified 



No. 

group B 

sequenced 

(%)* 



Genotypes Identified 
among the sequenced 
group B strains 



SAB1 BA (%)** 



New 
type 



May 2002 


1 


1 (100-0) 


1 


0(0) 


0 


Jan 2003 


33 


27 (81-8) 


25 


2 (7-4) 


0 


Apr 2004 


18 


13 (72-2) 


12 


1 (8-3) 


0 


Jan 2005 


117 


1 1 1 (94-9) 


3 


108 (97-3) 


0 


Mar 2006 


9 


9 (100-0) 


0 


9 (100) 


0 


Jan 2007 


22 


19 (86-2) 


0 


19 (100) 


0 


Jan 2008 


158 


140 (88-6) 


0 


140 (100) 


0 


Feb 2009 


29 


14 (48-2) 


0 


14 (100) 


0 


Mar 2010 


81 


60 (74-1) 


0 


60 (100) 


0 


Feb 2011 


16 


1 5 (93-8) 


0 


15 (100) 


0 


Mar 2012 


90 


79 (87-8) 


0 


76 (96-2) 


3 


Total 


574 


488 (85-0) 


41 


444 (91-0) 


3 


*Numbers 


in brackets represent the proportion of group 


B RSV- 



positive samples that were sequenced of all those that were identified 
and subtyped as group B. 

**Numbers in brackets represent percentage of the sequenced group 
B genotypes that genotyped into BA. 

Peak epidemic month given by the three letter month abbreviation. 



patients were admitted with severe pneumonia, and the third 
patient had very severe pneumonia (see ref " for syndromic 
definitions). All three nucleotide sequences (KEN/201/Mar/ 
2012, KEN/202/Mar/2012 and KEN/203/May/2012) obtained 
from the three patients were identical across the entire 633 
nucleotides sequenced. 

A BLAST search to track their ancestry (http://blast.ncbi. 
nlm.nih.gov/Blast.cgi) querying the second hypervariable 
region of G gene found that these three non-BA sequences 
were closely related to strains from Brazil collected in 
between 2004 and 2007 with the top matches being those 
isolated in 2007. These Brazilian 2007 strains were non-BA 
type and have been classified under the GB3 genotype in their 
GenBank annotation. 

The phylogenetic relationship of the KUifi 2012 non-BA 
sequences, the other group B genotypes identified at Kilifi 
(BA and SABl), the top 25 of the blast hits, and represen- 
tative sequences of aU previously described group B 
genotypes around the world is given in Figure 1. The Kilifi 
2012 non-BA sequences and the previously circulating non- 
BA sequences at Kilifi (SABl) fell on separate branches that 
were distant from each other. The Kilifi 2012 non-BA 
sequences and the non-BA sequences that were isolated in 
Gambodia (2008-2009, designated SAB4'') and China (before 
2005) alongside the Brazilian sequences (2004-2007) fell into 
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one cluster but with only moderate support (bootstrap value 
50%) although clearly separated from the other group B RSV 
genotypes, Figure 1. 

The average between-group genetic distances between the 
Kilifi 2012 non-BA sequences and the Brazilian, Chinese, and 
Cambodian non-BA viruses were 2-8%, 6-4%, and 94%, 
respectively. These distances imply significant diversification 
between these sequences despite the relatively close 
relationship on the tree. The MRCA analysis predicted the 
KUifi 2012 non-BA viruses and that the Brazilian viruses 
diverged fi'om June 2007 (95% HPD, January 2002-February 
2012), while the divergence with the Cambodian 2008-2009 
non-BA viruses was predicted to have occurred much earlier 
- August 2000 (95% HPD, February 1985-May 2009). 
However, these estimates should be interpreted with caution 



as the HPD confidence intervals are very wide and 
overlapping. 

Within the carboxy terminus region of the encoded G 
protein, the Brazilian 2007 viruses and the Kilifi 2012 non- 
BA viruses differ by six amino acids. Figures 2 and 3. 
Interestingly, three of these changes affect sites that were 
previously shown to be under positive selection in group B 
j^gy 13,14 jj^^^ positions 223, 251, and 258. Studies with 
RSV A strains have previously noted that multiple epitopes 
lie in this region of the RSV G gene recognized by human 
convalescent sera, and changes affecting some particular 
positions, some of which were identified as positively 
selected, can have profound effect on the overall antibody 
recognition.^ Thus, these substitutions in the 2012 non-BAs 
could be of considerable significance. 
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New 201 2 Type 
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Figure 1. A maximum-likelihood tree of 53 sequences including the top 25 biast hits, the 3 new non-BA Kilifi sequences, 10 sequences representative 
strains previously collected at Kilifi (both BA and non-BA), and 1 5 representative sequences of all the previously described group B genotypes. The tree is 
derived from an alignment of the carboxy terminus region nucleotide sequences of the RSV G protein gene [264-nt-long region (324 for BA)]. The nodes 
of branches with exclusively Kilifi sequences are indicated with a red-filled square box, and the associated branches of these Kilifi sequences are also 
colored in red. The nomenclature of the taxa is a three-letter code for the country of sequence origin/GenBank accession number/strain name. 
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Figure 2. The figure shows an alignment of partial G-protein-predicted amino acid sequences for the Brazilian, Chinese, and Cambodian non-BA 
sequences that were identified in the second half of the last decade and the Kilifi non-BA sequences [both the new 2012 and the old (SAB1 strains)]. The 
sequences are compared with GB1 strain (USA/CHI Ob) given at the top of the alignment as a reference strain. Where the amino acid sequence is similar 
to the GB1 strain sequence, the position is shown with dot along the column otherwise implies a mutation. The asterisks denote stop codons. At the 
bottom of the columns, positions previously identified as positively selected in group B RSV are indicated with a plus (+) symbol. ^^'^ Gray-highlighted 
columns show amino acid sequence positions that changed and were also identified to be positively selected. The taxa nomenclature is a three-letter code 
for the country of sequence origin/GenBank accession number/strain name. 
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Figure 3. Panel (A) shows a maximum-likelihood 
tree from 232 sequences comparing 599 
nucleotide positions of the RSV G protein gene. 
This includes all unique BA sequences observed at 
Kilifi between 2003 and 2012 (224 sequences), the 
3 Kilifi 2012 non-BA strains (marked red), and 5 of 
the close Brazilian non-BA sequences collected in 
2007 that span this length (marked green). In the 
alignment used to generate this tree, the 60- 
nucleotide insertion of the BA strains was removed 
to investigate whether the remaining nucleotide 
positions were highly similar to BA, which could 
then suggest that the new non-BA strains emerged 
by spontaneous deletion of this region from BA 
genotype. However, we observe that these new 
Kilifi non-BA strains cluster far away on the tree 
from the current BA strains and are close to the 
Brazilian non-BA strains - boxed in (a) with a 
dashed rectangle and expanded as panel (B). Panel 
(C) shows the seven amino acid changes in the Kilifi 
2012 non-BA strains from the Brazilian 2007 
strains and their respective positions that they were 
observed (six of the seven changes occur in the 
second hypervariable region). 
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We have previously reported the detection of minority 
genomes within two patients sampled in 2008 that showed a 
60-nucleotide deletion at the exact position where the 
insertion occurred in the BA variant, and we speculated this 
as a potential mechanism by which strains without the 
duplication could re-emerge.'^ Our phylogenetic analysis 
comparing the new non-BA strains with the circulating BA 
strains at Kilifi from which the 60-nucleotide duplication 
sequence was excised found the new strains were only 
distantly related to BA; that is, they do not represent BA 
strains that have recently lost their duplication (Figure 3). 
This observation of wild-type non-BA viruses still in 
circulation provides a mechanism for replacement of the 
BA strains if the BA selective advantage diminishes. 

In conclusion, we report the total dominance of the BA 
genotype at Coastal Kenya among group B isolates over six 
epidemics and re-appearance of non-BA strains during the 
first half of 2012. These new non-BA strains appear to have 
been circulating globally, albeit as a minority genotype, for 
several years and are not emerging de novo from the BA 
genotype. Compared with their closest relatives in GenBank, 
the new strains were found to have undergone changes at 
positions believed to be located in immune epitopes, and it will 
be interesting to observe whether these variants will increase in 
prevalence over time and succeed in dominance over the BA 
genotype. These observations raise questions on the amount of 
undetected RSV diversity existing and the mechanisms driving 
changes to population-level RSV diversity. 

GenBank accession numbers 

The G sequences of the Kilifi strains reported in this study are 
deposited into GenBank under the accession numbers 
KC263040-KC263051. 

Acknowledgements 

We thank the Viral Epidemiology and Control (VEC) team 
at Kilifi for collecting and genotyping the samples analyzed 
here. This study is published with permission of the director 
of KEMRI. The work was supported by Wellcome Trust 
grants 084633 and 084538. 



References 

1 Collins PL, Melero JA. Progress in understanding and controlling 
respiratory syncytial virus: still crazy after all these years. Virus Res 
2011; 162:80-99. 

2 White LJ, Mandl JN, Gomes MG ef al. Understanding the transmis- 
sion dynamics of respiratory syncytial virus using multiple time series 
and nested models. Math Biosci 2007; 209:222-239. 

3 Cane P. Molecular epidemiology and evolution of RSV; in Cane P 
(ed.): Respiratory Syncytial Virus. Amsterdam: Elsevier, 2007; 89-1 13. 

4 Agoti CN, Mwihuri AG, Sande CJ ef al. Genetic relatedness of 
infecting and reinfecting respiratory syncytial virus strains identified in 
a birth cohort from rural Kenya. J Infect Dis 2012; 206:1532-1541. 

5 Trento A, Viegas M, Galiano M et al. Natural history of human 
respiratory syncytial virus inferred from phylogenetic analysis of the 
attachment (G) glycoprotein with a 60-nucleotide duplication. J Virol 
2006; 80:975-984. 

6 Trento A, Casas I, Calderon A et al. Ten years of global evolution of 
the human respiratory syncytial virus BA genotype with a 60- 
nucleotide duplication in the G protein gene. J Virol 2010; 84: 
7500-7512. 

7 Sande CJ, Mutunga MN, Medley GF, Cane PA, Nokes DJ. Group- and 
genotype-specific neutralizing antibody responses against respiratory 
syncytial virus in infants and young children with severe pneumonia. 
J Infect Dis 2013; 207:489^92. 

8 Zhang RF, Jin Y, Xie ZP ef al. Human respiratory syncytial virus in 
children with acute respiratory tract infections in China. J Clin 
Microbiol 2010; 48:4193^199. 

9 Arnott A, Vong S, Mardy S et al. A study of the genetic variability of 
human respiratory syncytial virus (HRSV) in Cambodia reveals the 
existence of a new HRSV group B genotype. J Clin Microbiol 2011; 
49:3504-3513. 

10 Zhang ZY, Du LN, Chen X et al. Genetic variability of respiratory 
syncytial viruses (RSV) prevalent in Southwestern China from 2006 to 
2009: emergence of subgroup B and A RSV as dominant strains. J Clin 
Microbiol 2010; 48:1201-1207. 

1 1 Nokes DJ, Ngama M, Bett A ef al. Incidence and severity of respiratory 
syncytial virus pneumonia in rural Kenyan children identified through 
hospital sun/eillance. Clin Infect Dis 2009; 49:1341-1349. 

12 Drummond AJ, Rambaut A. BEAST: Bayesian evolutionary analysis by 
sampling trees. BMC Evol Biol 2007; 7:214. 

13 Zlateva KT, Lemey P, Moes E, Vandamme AM, Van Ranst M. Genetic 
variability and molecular evolution of the human respiratory syncytial 
virus subgroup B attachment G protein. J Virol 2005; 79:9157-9167. 

14 Woelk CM, Holmes EC. Variable immune-driven natural selection in 
the attachment (G) glycoprotein of respiratory syncytial virus (RSV). 
J Mol Evol 2001; 52:182-192. 

15 Agoti CN, Mbisa JL, Bett A, Medley GF, Nokes DJ, Cane PA. 
Intrapatient variation of the respiratory syncytial virus attachment 
protein gene. J Virol 2010; 84:10425-10428. 



1012 



© 2013 John Wiley & Sons Ltd. Influenza and Other Respiratory Viruses published by John Wiley & Sons 



